Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Turtle: identifying frequent k-mers with cache-efficient algorithms.

Identifieur interne : 001A35 ( Main/Exploration ); précédent : 001A34; suivant : 001A36

Turtle: identifying frequent k-mers with cache-efficient algorithms.

Auteurs : Rajat Shuvro Roy [États-Unis] ; Debashish Bhattacharya [États-Unis] ; Alexander Schliep [États-Unis]

Source :

RBID : pubmed:24618471

Descripteurs français

English descriptors

Abstract

Counting the frequencies of k-mers in read libraries is often a first step in the analysis of high-throughput sequencing data. Infrequent k-mers are assumed to be a result of sequencing errors. The frequent k-mers constitute a reduced but error-free representation of the experiment, which can inform read error correction or serve as the input to de novo assembly methods. Ideally, the memory requirement for counting should be linear in the number of frequent k-mers and not in the, typically much larger, total number of k-mers in the read library.

DOI: 10.1093/bioinformatics/btu132
PubMed: 24618471


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Turtle: identifying frequent k-mers with cache-efficient algorithms.</title>
<author>
<name sortKey="Roy, Rajat Shuvro" sort="Roy, Rajat Shuvro" uniqKey="Roy R" first="Rajat Shuvro" last="Roy">Rajat Shuvro Roy</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
<author>
<name sortKey="Bhattacharya, Debashish" sort="Bhattacharya, Debashish" uniqKey="Bhattacharya D" first="Debashish" last="Bhattacharya">Debashish Bhattacharya</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
<author>
<name sortKey="Schliep, Alexander" sort="Schliep, Alexander" uniqKey="Schliep A" first="Alexander" last="Schliep">Alexander Schliep</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2014">2014</date>
<idno type="RBID">pubmed:24618471</idno>
<idno type="pmid">24618471</idno>
<idno type="doi">10.1093/bioinformatics/btu132</idno>
<idno type="wicri:Area/PubMed/Corpus">001A30</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001A30</idno>
<idno type="wicri:Area/PubMed/Curation">001A30</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001A30</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001695</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001695</idno>
<idno type="wicri:Area/Ncbi/Merge">000D13</idno>
<idno type="wicri:Area/Ncbi/Curation">000D13</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000D13</idno>
<idno type="wicri:Area/Main/Merge">001A40</idno>
<idno type="wicri:Area/Main/Curation">001A35</idno>
<idno type="wicri:Area/Main/Exploration">001A35</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Turtle: identifying frequent k-mers with cache-efficient algorithms.</title>
<author>
<name sortKey="Roy, Rajat Shuvro" sort="Roy, Rajat Shuvro" uniqKey="Roy R" first="Rajat Shuvro" last="Roy">Rajat Shuvro Roy</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
<author>
<name sortKey="Bhattacharya, Debashish" sort="Bhattacharya, Debashish" uniqKey="Bhattacharya D" first="Debashish" last="Bhattacharya">Debashish Bhattacharya</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
<author>
<name sortKey="Schliep, Alexander" sort="Schliep, Alexander" uniqKey="Schliep A" first="Alexander" last="Schliep">Alexander Schliep</name>
<affiliation wicri:level="4">
<nlm:affiliation>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USA.</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901, USADepartment of Computer Science, Department of Ecology, Evolution and Natural Resources, Institute of Marine and Coastal Sciences and BioMaPS Institute for Quantitative Biology, Rutgers University, New Brunswick, NJ 08901</wicri:regionArea>
<placeName>
<region type="state">New Jersey</region>
<settlement type="city">New Brunswick (New Jersey)</settlement>
</placeName>
<orgName type="university">Université Rutgers</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Bioinformatics (Oxford, England)</title>
<idno type="eISSN">1367-4811</idno>
<imprint>
<date when="2014" type="published">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Genome, Human</term>
<term>High-Throughput Nucleotide Sequencing (methods)</term>
<term>Humans</term>
<term>Sequence Analysis, DNA (methods)</term>
<term>Software</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN ()</term>
<term>Génome humain</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Séquençage nucléotidique à haut débit ()</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>High-Throughput Nucleotide Sequencing</term>
<term>Sequence Analysis, DNA</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Genome, Human</term>
<term>Humans</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de séquence d'ADN</term>
<term>Génome humain</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Séquençage nucléotidique à haut débit</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Counting the frequencies of k-mers in read libraries is often a first step in the analysis of high-throughput sequencing data. Infrequent k-mers are assumed to be a result of sequencing errors. The frequent k-mers constitute a reduced but error-free representation of the experiment, which can inform read error correction or serve as the input to de novo assembly methods. Ideally, the memory requirement for counting should be linear in the number of frequent k-mers and not in the, typically much larger, total number of k-mers in the read library.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>New Jersey</li>
</region>
<settlement>
<li>New Brunswick (New Jersey)</li>
</settlement>
<orgName>
<li>Université Rutgers</li>
</orgName>
</list>
<tree>
<country name="États-Unis">
<region name="New Jersey">
<name sortKey="Roy, Rajat Shuvro" sort="Roy, Rajat Shuvro" uniqKey="Roy R" first="Rajat Shuvro" last="Roy">Rajat Shuvro Roy</name>
</region>
<name sortKey="Bhattacharya, Debashish" sort="Bhattacharya, Debashish" uniqKey="Bhattacharya D" first="Debashish" last="Bhattacharya">Debashish Bhattacharya</name>
<name sortKey="Schliep, Alexander" sort="Schliep, Alexander" uniqKey="Schliep A" first="Alexander" last="Schliep">Alexander Schliep</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001A35 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001A35 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:24618471
   |texte=   Turtle: identifying frequent k-mers with cache-efficient algorithms.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:24618471" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021